Apparatus and methods for locating data

ABSTRACT

The present invention provides for quick and efficient searches. In one embodiment, a search system, comprises an email search interface having at least a first email-specific attribute search field, a file search interface having at least a first file-specific attribute search field, a Web history search interface having at least a first Web-specific attribute search field, and an apparatus configured to perform incremental searching as the user enters characters into one or more attribute search fields.

RELATED APPLICATIONS

This application is related to copending application entitled METHODSAND SYSTEMS FOR SEARCH INDEXING, Ser. No. 10/654,588, and copendingapplication entitled METHODS AND SYSTEMS FOR WEB-BASED INCREMENTALSEARCHES, Ser. No. 10/654,596, filed on the same date as the presentapplication, the entirety of which are hereby incorporated by reference.

PRIORITY CLAIM

This application is a continuation of U.S. application Ser. No.10/654,595, filed Sep. 3, 2003, now U.S. Pat. No. 7,496,559 and entitled“APPARATUS AND METHODS FOR LOCATING DATA,” which claims the benefitunder 35 U.S.C. 119(e) of U.S. Provisional Application No. 60/408,015,filed Sep. 3, 2002, U.S. Provisional Application No. 60/413,013, filedSep. 23, 2002, U.S. Provisional Application No. 60/448,923, filed Feb.20, 2003, U.S. Provisional Application No. 60/470,903, filed May 14,2003, and U.S. Provisional Application No. 60/478,690, filed Jun. 13,2003, the contents of which are incorporated herein by reference intheir entirety.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention relates generally to data processing, and inparticular to systems and methods for locating data.

2. Description of the Related Art

Conventional search application programs often are slow and cumbersometo use. For example, many search engines require the user to type in asearch term, click on a search button, and review the results. If theuser is not satisfied with the results, as might be the case if too manyitems or items that are not of interest are found, then the user needsto edit the search terms, click on an initiate search button, and againreview the results. In addition, many search application programs onlyprovide for limited search filtering, wherein the user is limited tofiltering the search based on dates, file location, and file contents.

Another drawback of some search engines is that the actual searchprocess takes an annoying amount of time to complete a search.Furthermore, some search engines do not include search features directedto specific types of search targets, such as email, Web pages, andfiles. Alternatively and inconveniently, separate tools may be needed tosearch for email, Web pages, and files if the user desires correspondingspecific search features.

Further, with many Web-based search systems, the user types in a searchphrase in a Web page search field presented via a browser on the user'scomputer screen, and then presses an Enter key or Search button to havethe query transmitted to a remote server system that will perform thesearch. The server processes the query, generates a list of searchresults, and sends the list back to the user's browser. The user thenclicks on one of the choices, whereupon a target page is displayed. Theprocess from the time the user starts typing, until the user sees thetarget page, can take at least 5 seconds, and often takes 30 seconds ormore.

Because of the turnaround time for typing in a search phrase toreceiving search results, this conventional search procedure can demanda deliberative and tedious strategy from the user in order to searchefficiently. First the user may think carefully regarding what searchterms to use, the user then enters the search term, initiates thesearch, after several seconds receives and reviews the list of results,decides which, if any of the results seem relevant enough to click on,click on a selected list item, wait several seconds for thecorresponding Web page to be returned and displayed on the user'sbrowser, and finally reviews the Web page.

SUMMARY OF THE INVENTION

One example embodiment of the present invention advantageously providesfor incremental or reactive searching of a variety of search targets.For example, the search targets can include one or more of files,emails, email attachments, Web pages, specific databases, and/or thelike. Because a search is performed incrementally, the search resultsare provided or narrowed substantially immediately after each characterin a search string is entered by a user. Thus, the user is provided withsubstantially immediate feedback as the search string is being entered,and so can quickly decide on the desirability of entering additionalsearch characters, entering a new search string, or deleting one or moresearch string characters.

An example search application generates one or more indexescorresponding to the search target types. An index optionally includesboth content information and attribute information for potential searchtargets. The attribute information can be specific to the type of searchtarget. For example, the index can include at least partially differentattribute information for files as compared to emails. The indexing isoptionally performed incrementally.

In one embodiment, the search application includes search interfacescorresponding to different types of searches. For example, the searchinterfaces can include an email search interface, a files searchinterface, a Web search interface, a favorites search interface, and/ora specific database search interface. Thus, in one example embodiment,optionally a single search application advantageously provides searchmode specific interfaces for different types of search targets.

Another embodiment is a search apparatus, comprising: a plurality ofuser selectable search mode interfaces, including at least: an emailsearch mode interface having email-specific attribute search fieldsincluding at least a date field, a from field, and a sender field; afile search mode interface having file-specific attribute search fieldsincluding at least a file name field, a file type field, a date field, afile size field, and a path field; a favorites search mode interfaceused to search Web pages designated by a user as a favorite Web page; aWeb history search mode interface having a date field, a title field, asize field, and a search term field; an email attachment search modeinterface having a name field, a date field, a size field, and anextension field; wherein each of the email search mode interface, thefile search mode interface, the favorites search mode interface, the Webhistory search mode interface, and the email attachment search modeinterface further comprises a list pane, used to display a list ofsearch result items, and a view pane, used to display the contents of aselected search result item. In addition, also included is an indexmodule configured to generate an email index, a file index, and a Webpage index, as well as a search module configured to perform incrementalsearching of at least one of the email index, the file index, and theWeb page index in response to the user entering characters into at leastone search mode interface field.

Another embodiment is a search apparatus, comprising: a plurality ofuser selectable search mode interfaces having attribute fields,including at least: an email search mode interface having email-specificattribute search fields; a file search mode interface havingfile-specific attribute search fields; a Web history search modeinterface having Web page-specific attribute search fields; an emailattachment search mode interface having email attachment-specificattribute search fields. The search apparatus further comprises an indexmodule configured to generate at least a first index, including one ormore indexes of email, files, and Web pages, and a search moduleconfigured to perform incremental searching of the first index as theuser enters characters into at least a first attribute search field.

In still another embodiment, a search system comprises an email searchinterface having at least a first email-specific attribute search field,and a file search interface having at least a first file-specificattribute search field. The search system further comprises an apparatusconfigured to perform incremental searching as the user enterscharacters into the attribute search fields.

One embodiment is a method of locating information, comprising:receiving a first search character in a first attribute search field;immediately providing first search results in response to receiving thefirst search character; receiving a second character in the firstattribute search field; immediately providing second search results,narrower than the first search results in response to receiving thesecond search character, wherein the second search results includesentries corresponding to documents having at least a first characterstring including the first and second characters; receiving a thirdsearch character in a second attribute search field; immediatelyproviding third search results narrower than the second search resultsin response to receiving the third search character; receiving a fourthcharacter in the second attribute search field; and immediatelyproviding fourth search results, narrower than the third search resultsin response to receiving the fourth search character, wherein the fourthsearch results includes entries corresponding to documents associatedwith at least a first character string including the first and secondcharacters and a second character string including the third and fourthcharacters.

In one embodiment, a search system comprises: a first target searchinterface including a plurality of target-specific attribute searchfields that are displayed at the same time, a view area, and a listarea. The search system further comprises an apparatus configured toperform incremental searching as the user enters characters into one ormore of the target-specific attribute search fields and to provide alist of search results in the list area and content information in theview area.

BRIEF DESCRIPTION OF THE DRAWINGS

Embodiments of the present invention will now be described withreference to the drawings summarized below. These drawings and theassociated description are provided to illustrate example embodiments ofthe invention, and not to limit the scope of the invention.

FIG. 1 illustrates an example computer system that can be used inaccordance with one embodiment of the present invention.

FIG. 2A illustrates an example index architecture.

FIG. 2B illustrates an example search process using the index.

FIG. 2C illustrates an example indexing process.

FIG. 2D illustrates an example use of a pre-computed sort list.

FIGS. 3A-I illustrate example user interfaces in accordance with oneembodiment of the present invention.

FIGS. 4A-4F illustrate example search processes.

FIG. 5 illustrates an example computer system that can be used toprovide a server based search system in accordance with one embodimentof the present invention.

FIG. 6 illustrates an example search process using the server basedsearch system.

FIG. 7 illustrates an example process of scrolling through a searchlist.

Throughout the drawings, like reference numbers may be used to refer toitems that are identical or functionally similar.

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS

Embodiments of the present invention provide systems and methods forsearching, index, and presenting information. As will be described ingreater detail below, in one example embodiment, a search apparatusprovides multiple search fields corresponding to different search targetcharacteristics. As a user enters in a search term into a field, thesearch results are refined incrementally with each entered character.Thus, the search results are shown as the user enters search charactersor terms. To further refine or filter the search results, the user canenter other search terms into other fields. In one embodiment, thefiltering process is performed substantially immediately upon the userentering addition search characters or terms.

Further, a user can incrementally broaden the search results by deletingor backspacing over previously entered search characters. The searchresults are broadened substantially immediately, in response to the userdeleting or back spacing over a character, without requiring that theuser click on or otherwise activate a “search” button or the like. Asearch, for example, can typically take 200 ms or less, though in somesituations it can take longer, such as, for example, 300 ms, 400 ms, 600ms, 1 second, or 2 seconds.

In one embodiment, a user interface includes a list area or pane, whichlists the search results (optionally including headlines or summaries),and a view area or pane, which displays an item or contents of an item,such as an item selected in the list area. Advantageously, the user cancustomize an example search system in accordance with the user'spreferences.

Throughout the following description, the term “Web site” is used torefer to a user-accessible server site that implements the basic WorldWide Web standards for the coding and transmission of hypertextualdocuments. These standards currently include HTML (the Hypertext MarkupLanguage) and HTTP (the Hypertext Transfer Protocol). In addition,reference is made to Java script (also referred to as JavaScript),though other types of script, programming languages, and code can beused as well. It should be understood that the term “site” is notintended to imply a single geographic location, as a Web or othernetwork site can, for example, include multiple geographicallydistributed computer systems that are appropriately linked together.Furthermore, while the following description relates to an embodimentutilizing the Internet and related protocols, other networks, such asnetworked interactive televisions, and other protocols may be used aswell.

In addition, unless otherwise indicated, the functions described hereinare preferably performed by executable code and instructions running onone or more general-purpose computers. However, the present inventioncan also be implemented using special purpose computers, state machines,and/or hardwired electronic circuits. The example processes describedherein do not necessarily have to be performed in the describedsequence, and not all states have to be reached or performed.

Further, while the following description may refer to “clicking on” alink or button, or pressing a key in order to provide a command or makea selection, the commands or selections can also be made using otherinput techniques, such as using voice input, pen input, mousing orhovering over an input area, and/or the like.

One example search system provides for incremental searching across aplurality of attribute fields for files, emails, email attachments, theInternet, other networks, prior Internet or other network search results(“history”), favorites, specific databases, and/or the like. In oneexample embodiment, the search system includes a client-based searchapplication. Other embodiments can utilize a server-based searchapplication.

One example search application includes tabs corresponding to an emailsearch interface, a files search interface, a Web search interface, afavorites search interface, an email attachment interface, a Web historysearch interface, and/or a specific database search interface. Thus, inone example embodiment, a search application provides search specificinterfaces for different types of search targets.

Optionally, all the tabs are efficiently displayed and selectable in aplurality screens. Thus, while viewing a first target search interface,a user can select a different search target interface by moving thecursor over a given tab and then clicking on the tab. In anotherembodiment, a list of interface links or a drop down menu can be used toselect the target specific search interfaces. In addition, other methodsof selecting the search target interfaces can be used as well. For easeof use, the different interfaces optionally have similar layouts withrespect to the positioning of may of the interface elements, such as thelist and view areas, the tabs, and main search field.

FIG. 1 illustrates example hardware and software components on aphysical and/or functional level that can be invoked or used whenperforming a search and/or an indexing operation. As depicted, oneembodiment includes a search application 102 stored on and executed by auser terminal 104. As will be described in greater detail below, thesearch application 102, provides user interfaces for searching email,files, Web sites, cached Web pages, databases and the like. In addition,the search application 102 has a local index engine 103 that indexesemail, files, cached Web pages, databases and the like, stored in a datarepository or database 105. For example Web pages previously viewed inthe search application's view pane or area, and optionally, stored Webpages previously viewed using other user browsers, or otherwise storedlocally can be indexed. Separate indexes can be used for the email,files, cached Web pages, databases and the like, or a single index canbe used for the foregoing.

The index engine 103 can further include code configured as a scanengine or module that is used to determine whether a file is to beindexed. Thus, the index engine 103 can also scan files to identify newtargets, such as email, document files, Web pages, database entries, andthe like, that have not yet been indexed, or targets that have beenpreviously been indexed but have since been modified. Optionally, ratherthen re-index all corresponding targets each time an index operation isperformed, the index engine 103 can incrementally index just the new ormodified targets or documents.

The index engine 103 can utilize one or more indexing algorithms tocreate an index, such as a reverse or inverted index. The index includesa data structure that associates character strings with files,documents, and the like. In one example embodiment, for each word orcharacter string found with a file or document, the index stores whichfields of which documents or files contain that word or characterstring.

In one example embodiment, the index is constructed in blocks, which forconvenience will be referred to as incremental index bulks or bulks. Inthis example, there is an incremental index bulk for each indexed targettype, such as files, email, favorites, email attachments, stored Webpages. By way of example, a bulk can be 4 Mbytes in size, 16 Mbytes insize, or other sizes. A given bulk can be a memory mapped block ofvolatile memory, such as DRAM or SRAM, in which list structures for thecorresponding index are built. The bulks can be stored on slower massmemory, such as a hard disk drive, and then read into faster RAM uponstartup of the search application. Advantageously, having the bulkmemory mapped into RAM allows for fast searches and updating, ascompared with having to translate the information from a disk storageformat to a different RAM format.

When the bulk fills up so that the allocated memory space for the bulkis filled or substantially filled by the index data, the bulk is storedin non-volatile mass memory, such as a magnetic drive, and is termed afixed index. The complete index for a given target type can includeseveral fixed indexes and the bulk. As new documents or files are addedto the index, they add to the bulk, rather than directly to the fixedindexes. Optionally, the bulk can be stored in mass, non-volatilememory, such as a hard disk drive, and read into RAM as needed.

In the example embodiment, when a predetermined number of fixed indexes,such as 5, are created the multiple fixed indexes are optionally mergedinto a smaller number of indexes, such as single relatively larger fixedindex. Advantageously, this permits searches to be completed inrelatively less time, as fewer indexes, and optionally only one index,needs to be searched. Search speed is much more closely correlated withthe number of indexes that need to be searched, as compared to the sizeof the index, and therefore it takes less time to search one large fileas compared to 5 or 10 other files that are in aggregate the same sizeas the one large file.

After a merge process is performed, new items are indexed and added tothe corresponding bulk, and additional indexes are stored innon-volatile mass memory, as similarly described above. Generally, thefirst fixed index is significantly larger than the other fixed indexes,as the first fixed index is the merged composite of previous fixedindexes, while the additional fixed indexes are each one bulk in size.

In one embodiment, a fixed index can include a plurality of separatefiles, though in other embodiments, a single file containing the same orequivalent data can be used. For example, a “col” file stores attributedata for the target files corresponding to the index. As will bediscussed below, the attributes can be used as search criteria and canbe displayed to the user in column format. For example, with respect toemail a user interface for searching emails can include a from (sender)attribute column, a to (addressee) attribute column, date/time sentand/or received attribute columns, a folder path attribute column, aclient attribute column, a cc attribute column, a size attribute column,and a bcc attribute.

The fixed index can further include an “occur” file, which is the maininverted index data file. A “occur-offsets” file can also be generated.The occur-offsets file stores pointers into the “occur” file. The fixedindex additionally includes a “words” file that comprises a list of thewords that occur in the target files. Additionally there is a“deletions” file that lists indexed documents or files that have sincebeen deleted. A “body” file can also be generated which has the documenttext. The body file content is used for view pane display when theactual document or document is not available or may become unavailablewhen the content is needed for display in the view pane. The body filecan be particularly useful for more transient files, such as emails. Theexample bulk includes a words file, an occur file, and an occur offsetsfile.

FIG. 2A illustrates an example index configuration, though other indexconfigurations, types and structures can be used in other embodiments.Each target type can be associated with its own corresponding set offixed indexes and bulk. Advantageously, having separate sets of indexesfor each target type, search and indexing can be performed faster.However, in other embodiments, the same index or index set can be usedfor all target types. The example index configuration illustrated inFIG. 2A includes three fixed indexes 204A, 206A, 208A stored on massstorage media 202A, which can be, for example a magnetic or optical diskdrive. In this example, fixed index 204A is a merged index and is largerthan indexes 206A, 208A which are non-merged index. Index 208A is thefinal index, that is, the index with the most recent additions and/ordeletions. The final index 208A includes the information from the bulk212A. The bulk 212A includes a words file, an occur file, and an occuroffsets file. Advantageously, the bulk 212A is stored in RAM 210A sothat additions and deletions can be made very quickly, without requiringconstant modification of the final index 208A stored on slower massstorage media 204A. Optionally, all the indexes can be read into RAM210A to increase search speed. Upon shut-down of the search application,the bulk 212A is written back to the index 208A for storage on thenon-volatile mass memory media 204A.

FIG. 2B illustrates an example search lookup operation. At state 202Bone or more search characters or strings are received. At state 204B thecharacter or character strings are located in the words file. At state206B the pointer or pointers for the character or character strings inthe occur-offsets file are located. At state 208B document-occurrenceinformation is read from the occur file using the located pointers. Atstate 210B the search results are presented to the user. The searchresults are displayed in the list pane or area, and the view area orpane displays the contents of the first item or a selected item from thelist area.

Optionally, the content displayed in the view area or pane isauto-scrolled down to the first matching character or character string,which may be highlighted or otherwise emphasized. The search term can beautomatically displayed with multiple preceding and succeeding lines orsentences. For example, a search for “matisse” might find a documentwhere the first appearance of “matisse” occurs at the 100^(th) linedown. The auto-scroll process automatically scrolls down to, orotherwise displays the content beginning at a certain point relative tothe first appearance of “matisse.” For example, the content can bedisplayed in the view pane from line 90 to line 110, substantiallyimmediately upon selecting that page in the list pane, so that the usercan immediately see the matching word “matisse” in context. Optionally,the user can specify how many sentences or lines are to be displayedbefore and after the search term.

Optionally, the view area or pane displays the content in a collapsedformat, so that several matches that are distributed throughout a vieweddocument can be seen at once, with one or more lines or sentences ofcontext above and below each match displayed as well, and the remaininglines deleted or “collapsed” into “ . . . ” types of lines, with +symbols for expanding the collapsed section to view that section or theentire document in uncollapsed format, and with − symbols to recollapsea section as desired.

In one optional embodiment, the index stores “prefix” entries, whichfurther enhances the speed of incremental filtering or searching. Forexample, in addition to bare or complete words, such as “dog,” commonprefixes, such as “d” or “do”, are stored as well. The “d” prefix entrycontains information indicating which documents or files contain a wordor character string starting with “d”. By preparing this informationduring indexing, the response for the first letter or two of a filteringor search operation is greatly enhanced as compared to a standardinverted index, which would have to locate thousands of matches at thetime of receiving the user query, thereby resulting in slow filteringoperations.

Optionally, a “sort” file is provided for each attribute column. Thesort files are cached sort orders for the documents or files in thefixed index, when sorted by that column. In one embodiment, the sortorder for each column is pre-computed before receiving a user sortinstruction, so that when the user provides a sort order, such as byclicking on a column header to sort, the sort operation is performedalmost instantly, in a few seconds or a fraction of a second, ratherthan in minutes. Optionally, the sort files can be cached in local RAMto speed up access of the sort files. The user can optionally specifythat a particular attribute is to be used as a default sort key whenpresenting a document list. Thus, for example, the user can specify thatas a default sort for email, emails should be automatically sorted andordered according to date/time received. In this example, thepre-computed default sort list, also referred to as a master sort list,would then be sorted according to date/time, and the master sort listwould be almost instantly presented with the pre-computed sort list whenthe user clicks on the email tab.

Optionally, a pre-computed sort list for a plurality of documents orfiles, or for every document or file in a selected search tab orinterface is stored in high speed local memory RAM, sorted by thecurrent sort order. The search engine can then search through this listwhen additional search terms are added. Advantageously, this enables thesubstantially instant redisplay of a given search result list in theselected order when the search results change as the result of a letterbeing typed or removed from the filtering phrase. Without the mastersort list the incremental filtering might appear to operate much moreslowly. In another embodiment, the column information is compressed sothat all or a large portion of the column information fits in RAM. Thecolumn information can then be sorted using memory-sort algorithms,rather than pre-computing the sort order. In one example embodiment, thecolumn information is compressed using a unique-string dictionary thatincludes each unique value that occurs in each attribute column file.The attribute column file contains pointers to items within thisunique-string dictionary, rather than the strings themselves.

In one example embodiment, the pre-computed sort list is generated andused as follows. Referring to FIG. 2D, a sort process begins at state202D, where a matching item list is retrieved or accessed from memory.At state 204D certain information in the matching item list is storedusing a bit mask that optionally contains a bit for every item in acorresponding master sort list, that is for each indexed item for agiven target type. The bit indicates whether the item in the master sortlist is a matching item or not, that is, whether the item is included inthe matching item list. For example, each bit can be set to either a “1”to indicate the item is a match, or a “0” to indicate no match. Forexample, if there are 1,000,000 indexed items in the master sort list,then the bit mask would use 1,000,000/8 bytes=125 k. Advantageously, useof the bit mask often significantly reduces that time it takes to locatematches as a quick bit-check operation can be performed on a given bitmask bit position, rather than necessitating a search through all or alarge portion of the master sort list.

At state 206D the bit for an item in the bit mask is read to determinewhether the master sort list item corresponding to the bit is a matchingitem or not. If the item is a matching item, the process proceeds tostate 208D at which the item is added to the sorted list results and theprocess proceeds to state 210D. Otherwise, the process from state 206Dto state 210D. At state 210D a determination is made as to whether theend of the master sort list has been reached. If not, the processproceeds back to state 206D and examines the next bit in the bit maskcorresponding to the next item in the master sort list. If the end ofthe master sort list has been reached, the process proceeds from state210D to state 212D.

In one embodiment, a bit mask is generated using the index fileinformation. For example, there can be one bit per document, indicatingwhich documents contain each word in the search phrase. Then Booleanoperations (for example, AND, OR, NOT, XOR, etc.) can be used to combinethe bit masks for some or all of the fixed indexes, and for some or allthe different words or character string, to get a bit mask of thematching results.

Referring back to FIG. 1, the database or repository 105 may beconfigured as multiple files and/or databases stored on an internal orexternal storage device of the user terminal 104. A given index caninclude the item content, and the time, date, item size, or otherattributes from when the index entry was created and/or last updated.The some or all of the foregoing index information can be used todetermine if a file has been previously indexed or modified since it waslast indexed, therefore needs to be indexed or re-indexed.

The search application 102 further includes a local search engine 107that utilizes one or more of the indexes created by the index engine 103to locate email, files, cached Web pages, databases and the like inresponse to a user query. For example, in response to a query the localsearch engine 107 accesses the index 105 and locates correspondingmatches. The search engine 107 returns excerpts or headlines fromrelevant matches, and also returns links to the matches. In addition,the search application 102 optionally passes user queries to one or moreremote search engines, receives the results, and displays the results tothe user. In one embodiment, the search application 102 includes commandline code that determines whether a user has entered a command into asearch field, and if so, causes the command to be executed. While thesearch engine 107 and the index engine 103 are illustrated as separateentities, in one embodiment, they are combined in a single engine ormodule.

The search application 102 can be downloaded or accessed from a remoteWeb site, loaded via removable, optical, magnetic, or semiconductormemory, pre-loaded on the terminal's mass storage apparatus, orotherwise stored and/or made accessible to the terminal 104.

The terminal 104 can be a personal computer, an interactive television,a networkable programmable digital assistant, a computer networkablewireless phone, and the like, that optionally has access to the Internetvia a network interface. The terminal 104 can include a display,keyboard, mouse, trackball, electronic pen, microphone (which can acceptvoice commands), other user interfaces, speakers, semiconductor,magnetic, and/or optical storage devices.

In addition to the search application 102, the user terminal 104 can runone or more commercially-available Web browser applications 106 such asMicrosoft Internet Explorer® or Netscape Navigator®, which implement thebasic World Wide Web standards such as HTTP and HTML.

The terminal 104 can also host and/or execute a commercially availablee-mail application 109, such as Microsoft Outlook® which may be used toreceive, send, and display emails. The e-mail application 109 and thebrowser 106 may be integrated with one another, and/or may be integratedwith other application programs or the terminal operating system.

In the embodiment described herein, a Web site 108 hosts a remote searchengine 110 executing on a computer system 111. The computer system 111includes one or more content and index databases 112. The Web site 108and search engine 110 are accessible to the search application 102 viathe Internet 113. The Web site 108 may optionally include content thatspans multiple Internet domains, and/or may be implemented usingphysical servers that are geographically remote from one another. Inother embodiments, the Web site 108 may be in the form of an intranetsite, in which case the terminal 104 may be coupled to the site by aprivate network. For example, Web site 108 may be in the form of aninternal corporate store site for company employees.

In other embodiments, the Web site 108 may be replaced with another typeof network site. For example, the various services described hereincould alternatively be implemented on a hypertextual site or browsingarea of an online services network such as America Online® or MSN®, inwhich case users may access the site using software that implementsnon-standard document formats and transfer protocols.

In addition, there can be several independent search engines associatedwith different Web sites, accessible to the terminal 104 via one or morecomputer networks.

As further depicted by FIG. 1, the Web site 108 includes acommercially-available Web server application 114. The Web serverapplication 114 accesses databases used to generate Web pages inresponse to queries from end users, including queries submitted via thesearch application 102. In addition, the Web server application 114 canrespond to the browser's request for a page, and deliver the page to theWeb browser 106 through the Internet 113.

By way of example, the search engine 110 may search server-baseddatabases, such as databases 112, of the full text of Web pages selectedfrom selected Web pages. By way of example, the databases 112 can begenerated using a spider or the like that “crawls” the Internetsearching for Web pages, which are then stored in the databases 112, andthat follow the links in those Web pages to other Web pages, which arethen stored in the databases. Because the databases 112, or portionsthereof, may have been generated before a user submits a search query,these databases may not be fully current. When the search engine 110performs a Web search in response to a user query, the search enginereturns excerpts from relevant Web pages as stored in a correspondingdatabase, and also returns links to the current version of the Webpages, if such exists.

In addition, a Web page may be added to the search engine databases 112by directly entering a URL. Optionally, the URL may be submitted by auser or someone associated with the URL. Once a Web page is stored, theWeb page is then indexed by the index engine 113. The indexing processmay include Web page text, links, and other content. The index is storedin or in association with the search engine database 112. When a usersubmit a query or search terms, the search engine 110 searches the indexbased on the query or search terms, including Boolean terms. The searchengine 110 locates matches, and links to the corresponding Web pages aretransmitted to the user terminal 104 for display. While the searchengine 110 and the index engine 113 are illustrated as separateentities, in one embodiment, they are combined in a single engine ormodule.

Optionally, certain types of Web pages and links can be excluded frombeing indexed. For example, optionally a Web site owner can request thatthe Web site pages are not to be included in the search engine database.

In addition the search application 102 includes a search applicationoptions form, via which the user can select the search application 102to be configured in one of a docked mode, docked with auto-hide mode,and a “normal” Windows mode. If docked mode is selected, and assumingthe search application 102 has been launched, a portion of the searchapplication interface, such as a control bar, will constantly be visibleat a fixed location of the user's display, such as on the top, left,right, or bottom of the user display. The control bar can include thesearch tabs discussed above, a main search field, a clear entry button,used to clear the main search field in response to a single click oractivation, and a “go” button, used to initiate a search in thoseapplications where the actual typing in of the search characters doesnot initiate the search. For example, when performing a search over theInternet, the user may optionally need to click on the go button toinitiate the search. An example control bar is illustrated in FIG. 31.

If the docked with auto-hide mode is selected, a portion of the searchapplication interface will be at fixed location of the user's display,wherein the portion will visibly appear as a control bar when the usercauses the cursor to point to the portion of the screen where theportion is docked for longer then a predetermined amount of time, suchas 0.5 seconds. If the user moves the cursor off the control bar withoutperforming a search, the control bar disappears, freeing screen spacefor use by other programs. If Windows mode is selected, the searchapplication 102 operates as a conventional Windows program, whereinthere is a non-docked, non-autohide window associated with the searchapplication 102.

The option form also allows the user to set the search application 102to start in email, files, Web, history, favorites, or database searchmode.

Example search interfaces for different search targets, such as emails,files, the Web, and the like, will now be described. By way of example,the email search fields can include one or more of a content field, afrom (sender) field, a to (addressee) field, date/time sent and/orreceived fields, a folder path field, a client field, a cc field, a sizefield, a bcc field, and a subject field. An example email search page orform is illustrated in FIG. 3A.

By way of further example, the file search fields can include one ormore of a content field, a file name field, a file type field, a datefield, a file size field, a path field, an author field, a subjectfield, and so on. An example files search page or form is illustrated inFIG. 3B.

By way of example, the history search fields can include one or more ofa content field, a date/time (last viewed) field, a title field, a sizefield, a URL field, a format field, a category field, a domain field, alinks field, a media type field, a language field, and/or a search termfield. An example history search page or form is illustrated in FIG. 3D.The Internet search fields can include a content field, a URL field, atitle or Web site name field, a size field, a date field, a formatfield, a category field, a site field, a links field, and a languagefield, and/or a media type field. The favorites search fields caninclude one or more of a content field, a URL or address field, a lastrefresh field, a title field, and/or the fields discussed above withrespect to the Internet search fields. An example favorites search pageor form is illustrated in FIG. 3E.

Example email attachment search fields can include one or more of acontent field, a name field, a date/time field, a from field, a sizefield, a MIME tag field, an extension field, and/or path field. Anexample email attachment search page or form is illustrated in FIG. 3F.

Optionally, incremental indexing can be performed for files, emails,email attachments, Internet (or other network) search results,favorites, specific databases, and/or the like. Incremental indexing canbe performed in direct response to a user command, continuously,periodically, and/or upon the occurrence of one or more specifiedconditions, generates and/or updates a corresponding index. For example,Web pages can be indexed as soon as they are stored or cached.Advantageously, incremental indexing can be used to provide consistentlycurrent search indexes.

For example, with respect to emails an email index can be incrementallyupdated upon the receipt of new emails, at a specified recurring period(for example, every 60 seconds), at one or more specific times duringthe day, or when the host computer system is idle. In one embodiment,the host computer system is considered idle when one or more of thefollowing conditions are met: when central processor usage falls below afirst threshold; when the host computer bus utilization falls below asecond threshold, when a user interface device, such as a keyboard, hasnot been activated for a first period of time. In other embodiments,different criteria can be used to determine with a computer system isidle. In addition, the user can manually specify that indexing of newemail shall be immediately performed, or that all email should bere-indexed.

By way of further example, with respect to files, a files index can beupdated upon the creation, editing, or saving of a file, at a specifiedrecurring period (for example, every 60 seconds), at one or morespecific times during the day, or when the host computer system is idle.In addition, the user can manually specify that indexing of new filesshall be immediately performed, or that all files should be re-indexed.An example files incremental indexing options form is illustrated inFIG. 3J. Advantageously, the user can optionally specify differentindexing trigger conditions for different target types, such as foremail and files. In one embodiment, the user can optionally specifydifferent indexing trigger conditions for history, favorites, andattachments as well. Optionally, a server hosting a database can beconfigured to notify the search application of database changes, and thesearch application's index engine can index the changes substantiallyimmediately after such notification.

Advantageously, an embodiment of the search application 102 can becustomized by a user to meet the user's particular needs or preferences.For example, the user can select which email application or applicationsthe email search is to be performed in conjunction with. For example,the user can select Microsoft Outlook®, Outlook Express, Netscape® Mail,Eudora® and/or other email applications. In addition, in one embodiment,the user can select one or more Internet search engines to be used inperforming a search. As described above, the search engines can beexecuted by the host computer, or by a remote server. For example, inone embodiment the user can specify the Internet or other networksearches are to be performed using one or more of Google™, Teoma™, andAltaVista® search engines.

With respect to Internet, email, file, or other searches, the user canoptionally specify that only search results that are likely to berelevant are to be displayed. For example, if the user requests thatonly search results that are likely to be relevant are to be displayed,then only the higher ranked search results are displayed to the user.For example, only the top 20, 50, 100, 500, or 1000 search results maybe displayed. Relevancy can be determine using one or more techniques,such as whether an exact match is found for the search terms, how manytimes the search terms are found in a document, the spacing of searchterms in a document from each other, Optionally, the relevancy of eachfound document or file is scored, and the score for one or more of thefiles or documents are displayed to the user adjacent to thecorresponding search results.

In addition, the user can optionally specify which local and/or networkfolders are to be indexed. Further, the user can optionally specify, ona global or file-by-file basis, that the file contents and allattributes are to be indexed, or only specified attributes, such as filename and/or size, are to be indexed. The user can also specify, on aglobal or file-by-file basis, that only certain types of files, or fileswith certain extensions, are to be indexed. For example, a user canspecify that only .txt, .doc, .avi, .pdf., and .mpg are to be indexed.

FIG. 2C illustrates an example indexing process for a given target thatcan be performed using the indexing/scanning module. As discussed above,the target can be a document, email, file, Web pages, or the like.Optionally, the user can specify that one or more of the following areto be included in the history index and/or search: Web pages locatedusing the Web search interface corresponding to Web tab 310A, Web pagescorresponding to the user's favorites, Web pages directly addressedusing the address fields on any of the search interfaces, Web pagesaccessed via a separate browser, such as Microsoft Explorer.

In this example, the user has specified that indexing is to be performedwhen the host computer is idle. Beginning at state 202C, anindexing/scanning module determines if the host computer is idle. If thehost computer is idle, the process proceeds to state 204C, where thescan process begins. At state 204C index item entries are marked, via abit or flag associated with a given item, as unseen. As will bediscussed below, the “unseen” bit is used to determine if a given itemhas been deleted and so is to be removed from the index. At state 206C afirst item located in an appropriate directory on the mass storagedevice, which can be a disk drive, is located. At state 208C, becausethe first item was located, the corresponding index entry is now markedas seen by setting the corresponding flag or bit.

At state 210C the modification date and time, and/or creation date andtime, of a given item is compared to the date and time the index wascreated or last updated to determine if the file needs to be indexed orre-indexed. For, example, if a file has been modified or created sincethe last time the index was updated, then those files are to be indexed.

Optionally, for email files and/or other file types, theindexing/scanning module generates its own “change ID” for eachappropriate file. For example, for emails the change ID is generatedbased on email characteristics or attributes, such as subject, sendername, and/or date. By way of example, and not limitation, the change ID(“uid”) can be an aggregate of two or more of the receive time, size,subject, and sender name, as well as the containing folder id, andmessage-store ID as follows.

uid=entryid+‘|’+fullpathtocontainingfolder+‘|’+folderid+‘|’+storeid‘|’+client.

-   -   entryid:    -   for the Outlook client, entryid=the MAPI entryid.    -   for other clients, entryid=the “message id” that can be found in        the email headers,    -   and if that's not present, entryid is generated as follows:    -   entryid=receiveddata+‘|’+size+‘|’+subject +‘|’+sendername    -   folderid:    -   for Outlook, folderid=the MAPI entryid of the containing folder    -   for OE (Outlook Express), use an id from its store    -   for other email clients, use the full path to containing folder    -   storeid:    -   for Outlook, storeid=the MAPI store id of the containing message        store    -   for other clients, storeid is an empty string.

When performing state 210C during a scan operation, the index moduleexamines the previously calculated change ID for each document or file,generates a new change ID for the documents or files, and compares theprevious and new change IDs. If the change IDs are different for a file,then that file will be re-indexed.

At state 212C, the new and/or modified item indexed or re-indexed. Atstate 214C a determination is made as to whether there are additionalitems in the appropriate disk directories. If there are, the processproceeds from state 214C to state 206C, where the index process isrepeated for the next item. Once all items have been indexed, theprocess proceeds to 216C. The index item entries are examined todetermine if any of the entries are still designated as unseen,indicating that the item was not found during the scan and indexprocess. If the item is designated as unseen then it is inferred thatthe item has been deleted, and those unseen items are deleted from theindex.

As illustrated in FIG. 3A, one example search application interfaceincludes a control bar 302A, including search mode tabs for files 304A,email 306A, email attachments 308A, the Internet World Wide Web (“Web”)310A, prior Web search results (“History”) 312A, and favorites 314A.There is also an options button 316A that, if selected, provides theuser with a variety of forms that allow the user to customize the searchapplication 102 in accordance with the user's preferences and needs. Inaddition, there is a help button 318A, and a suggestion button 320A,which can be activated when the user wants to submit a suggestionregarding the search application 102 or improvements thereto.Advantageously, having the suggestion button 320A located on a maininterface makes it more likely users will provide helpful suggestions,as compared to having such a suggestion button located on a buried menuor rarely accessed interface.

The tabbed interfaces can have a main search field 322A, wherein theuser can enter a search string of one or more alphanumeric characters.The main search field is used to search for a string corresponding toone or more of the search fields associated with the search mode, andoptionally other attributes. Thus, for example, with respect to theemail tab 306A, the search application 102 will incrementally search forthe main search field string in the email contents, from (sender) field,to (addressee) field, date/time sent or received fields, folder pathfield, client field, cc field, size field, and bcc field.

As illustrated in FIG. 3A, the search results are displayed in a listarea or pane 324A, which includes a list of the search results,including relevant attributes in column format. In the exampleillustrated in FIG. 3A, the attributes include a from (sender) attributecolumn, a to (addressee) attribute column, date/time sent and/orreceived attribute columns, a folder path attribute column, a clientattribute column, a cc attribute column, a size attribute column, a bccattribute column, and a subject field. In this example, each of thecolumns has a corresponding search field 330A-348A.

In addition, a view area or pane 326A is provided that displays thecontent of the first item or user-selected item from the list pane orarea 324A. The user can print the search results displayed in the listpane or area 324A, as well as the document, email, Web page, file, andthe like, displayed in the view pane or area 324A.

Further, when the user clicks on a list pane entry, the correspondingcontents are displayed in the view pane 326A. If the user double-clicks,or presses the Enter button, on the list pane entry, the searchapplication causes the entry to launch in its native application. Forexample, if the user double-clicks on a list pane entry corresponding toa Word file, the file will open in Word. Similarly, if the userdouble-clicks on a Web page entry in the list pane 324A, the Web pagewill open in the user's default browser. If the user double-clicks anemail entry, the email will open in the user's default emailapplication, such as Microsoft Outlook®.

The user can also open the folder containing a file listed in the listpane by right-clicking on the file name in the list area 324A andselecting “View Folder”. Similarly, the user can open the foldercontaining an email listed in the list area 324A by right-clicking onthe email name in the list pane and selecting “Locate On Disk”. The usercan right-click to mark email listed in the list area 324A as read orunread by using the corresponding “mark read” or “mark unread” buttonsat the top of the view area 326A. Further, the user can activate theplus and minus keys (+/−) to move from one highlighted search term tothe next (or previous) in the view area 326A.

In one embodiment, the user can adjust the width of the list and viewareas or panes 324A, 326A by dragging a divider bar 328A located betweenthem to the left or right. In addition, rather than having the list andview areas 324A, 326A visually joined, the list and view areas 324A,326A can optionally float separately from one another, and the user canseparately drag the areas as desired on the user's display device. Theuser can move up, down, and between the list and view panes 324A, 326Ausing appropriate controls, such as, by of example, up, down, left,right arrow buttons, the tab button, and/or the PgUp (page up) and PgDn(page down) buttons.

In addition, the user can adjust the column widths in the list pane orarea 324A by dragging a corresponding column side. Further, the user canreorder the column placement by grabbing a corresponding column header,using, for example, the left mouse button, and then dragging the columnto a new location. Optionally, the user can also delete columns. Theuser can cause the system to reverse a search sort order by clicking ona user selected column header.

Example searches will now be described. The searches can be performedusing an index, such as that illustrated in FIG. 2A, and using thesearch process as similarly described above with respect to FIG. 2B.

By way of example, in order to perform an email search, a user can clickon the email tab 306A and bring up the email search interface page,illustrated in FIG. 3A. As illustrated in FIG. 4A, at state 401A thesearch system waits for a user input. The user input can be, forexample, a search character string or strings entered into the mainsearch field 322A, a search character string or strings entered into anattribute search field, a sort command, or an inverse sort command. Thesearch character or strings can include one or more alpha characters,numeric characters, words and/or phrases that appear in the emailcontents, from (addresser) field, to (addressee) field, date/time sentor received fields, folder path field, client field, cc field, sizefield, a bcc field, and a subject field of the email(s), that the useris searching for.

If the user has entered one or more characters or strings into the mainsearch field 322A the process proceeds to state 402A where the searchstring is received. The characters or strings can be typed in one at atime by the user or a string or strings of multiple characters can bepasted in with a paste operation. Proceeding to state 404A, a search isincrementally performed using each entered character by the searchapplication's local search engine, and the list pane or area 324Adisplays incrementally narrowed down search results. Thus, states 402Aand 404A take place at almost the same time. For example, state 404A maytake place a fraction of a second or a very few seconds after state402A. Similarly, a user can incrementally broaden the search results bydeleting or backspacing over previously entered search characters. Thesearch results are broadened substantially immediately in response tothe user deleting or back spacing over a character, without requiringthat the user click on or otherwise activate a “search” button or thelike.

If the user entered one or more characters or strings into one or morecorresponding column fields 330A-348A the process proceeds from state401A to state 406A where the search characters or strings are received.For example, if the user is looking for an email that was received inApril of 2003, the user can enter “2003-04” or a portion thereof intothe date/time field 332A. At state 408A, as the user enters charactersinto a column search field, the search is further incrementallyperformed by the search application's local search engine, and the listpane or area 324A displays incrementally narrowed down search results.Thus, the user can enter or delete search terms in the search fieldsabove each of the attribute data columns to incrementally limit orbroaden the results based on matches in that column for the search termentered. The search is performed using the email index, including bulkand fixed indexes.

The user can optionally select a highlight process, wherein search termswill be highlighted in the list pane or area search results and/or inthe view pane or area content display. The highlighting can be in theform of a different coloring, the use of different fonts, blinking text,and/or the like.

At state 410A, by clicking on an appropriate column header, the user cancommand the system to sort the search results based on a selectedcolumn. For example, the user can click on the Size header to sort theresults based on size. The sort can have been pre-computed as similarlydescribed above so that the sort can be immediately displayed. At state412 the search results are accordingly sorted. In one embodiment, thedefault sort orders displays the smallest size email first and thelargest size email last.

At state 414A, clicking on the column header a second time reverses thesort order displayed to the user. For example, if the user believes thatthe email being searched for is large in size, the user can click on theSize column header so that the larger files are displayed at the top ofthe column.

FIGS. 3G-H illustrate the email search interface during two differentstages of an incremental search. The main search field 322A has thecharacter string “2003-0” entered, the from field 330A has the characterstring “KU” entered, and folder field 334A has the character string “IN”entered, and the to field 338A has the character string “KM” enteredwith the NOT argument (−) preceding it. The search engine has narrowedthe emails listed in list pane or area 324A to the twelve that have thecharacter string “2003-0” in any of the emails' attributes or content,have “KU” in the from field, are located in a folder having thecharacter string “IN” in the folder name, and that does not have thecharacter string “KM” in the to field. The character strings areidentically entered in FIG. 3H, except the character string (“2003-07”)entered in the main search field 322A now has one additional searchcharacter (a “7”). Substantially immediately after the “7” was entered,the search application incrementally narrowed the number of emailslisted in the list pane 324A from the twelve illustrated in FIG. 3G tothe four emails listed in FIG. 3H.

By way of further example, in order to perform a file search, a user canclick on the file tab 304A and bring up the files search interface page,illustrated in FIG. 3B. As illustrated in FIG. 4B at state 401B thesearch system waits for a user input. The user input can be, forexample, a search character string or strings entered into the mainsearch field 322A, a search character string or strings entered into anattribute search field, a sort command, or an inverse sort command. Ifthe user entered search characters or strings into the main file searchfield 306B, the entry is received at state 402B. For example, the entrycan include one or more of alpha characters, numeric characters, wordsand/or phrases that appear in the file contents, the file name, the filetype, the file modification date field, the file size, or the file path.At state 404B, as the user enters characters into the main search fieldthe search is incrementally performed by the search application's localsearch engine, and the list pane or area 308B displays incrementallynarrowed down search results.

If the user has entered one or more characters or strings into one ormore corresponding column attribute fields the process proceeds fromstate 401B to state 406B where the search characters or strings arereceived. For example, the user can provide a relevant string ofcharacters in a corresponding column attributes field, such as a filename field 312B, a file type field 314B, a date field 316B, a file sizefield 318B, and a path field 320B. By way of example, if the user islooking for a document located in a “Documents” file on the user'scomputer “C” drive, the user can enter “C: Documents” or a portionthereof in the path field 320B. At state 408B, as the user enterscharacters into a column search field, the search is furtherincrementally performed by the search application's search engine, andthe list pane or area 308B displays incrementally narrowed down searchresults. The search is performed using the file index, includingcorresponding bulk and fixed indexes.

At states 410B, 412B by clicking on an appropriate column header, theuser can command the system to sort the search results based on aselected column. For example, the user can click on the Size header tosort the results based on size. In one embodiment, the default sortorder displays the smallest size file first, and the largest size filelast. At states 414B, 416B clicking on the column header a second timereverses the sort order displayed to the user. As discussed above, thesearch terms can optionally be highlighted in either or both the listpane or area 308B and the view pane or area 310B.

When searching the Internet and/or Web sites, a user can click on theWeb site tab and bring up the Web search interface page, illustrated inFIG. 3C. As illustrated in FIG. 4C, at state 402C, the user enter asearch query into a main search field 304C, including alpha characters,numeric characters, words and/or phrases that the user wants to searchfor in the Web site contents, the Web site name, and/or the Web siteURL. At state 404C, the user initiates the search, such as by activatingthe “Go” button or pressing the Enter button and the initiation commandis received by the search application.

At state 406C, the query is transmitted over a network, such as theInternet, to one or more search engines, such as Google™, Teoma™, andAltaVista®, which perform searches in parallel based at least in part onthe user entry. At state 408C, the search results from the one or moresearch engines are returned to the search application, and the searchresults are displayed in the list area or pane 306C and the Web pagecontents of a selected or highest ranked search result is displayed inthe view pane or area 308C. Optionally, some or all of the Web pagescorresponding to the search results are pre-cached on the user'sterminal 104 or on other storage local to the user. For example, the Webpages of the first twenty (or other predetermined quantity) listedsearch results can be pre-cached. In one embodiment, the maximum numberof pre-cached Web pages corresponding to the search results is set to beequal to the number of search results that can be displayed in the listpane.

At state 410C the search system waits for a user input. The user inputcan be, for example, a search character string entered into the mainsearch field, a search character string entered into an attribute searchfield, a sort command, or an inverse sort command. If the user enteredsearch characters or strings into the main file search field, the entryis received at state 412C. At state 414C, as the user enters charactersinto the main search field the search is incrementally performed by thesearch application's local search engine, and the list pane or area 308Bdisplays incrementally narrowed down search results.

At state 416C, the user can narrow the search results displayed byentering a relevant string of characters in a corresponding columnattributes field. The illustrated example includes a “results” columnfield 310C that can be used to search for all indexed attributes and/orcontent. Other embodiments can include additional or different columnsand fields, such as a URL column and field, a title or Web site namecolumn and field, a size column and field, a date column and field, aformat column and fields, a category column and field, a site column andfield, a links column and field, and a language column and field, amedia type column and field corresponding to the type of media used onthe page, and/or the like.

The format field and column are used to search for and categorize pagesaccording to the page or document type and/or extension. For example,the format can be a Word document (.doc), an Acrobat document (.pdf), ora PowerPoint document (.ppt). The site field and column are used tosearch for and categorize pages according to a Web sites top-leveldomain name (for example, www.zzzzzzz.com).

The category field and column are used to search for and categorizepages according to their genre or type, which can have descriptivelabels such as “hubs”, “personal home page”, “commercial”, and/or thelike. The search system can examine certain Web page and sitecharacteristics during the crawling of the Internet or thereafter, andbased at least in part on those characteristics, identify the Web pageas belonging to one or more categories. If the categorization isperformed before a user requests a search, a user search request,including a category term, can be performed much faster than if thecategorization was performed at the time of, or in response to the usersearch request.

For example, the category “hubs” or “directory” can correspond to Webpages that have many different links to other Web pages. In addition tothe quantity of links, the quality of links can also be taken intoaccount by the search system in categorizing a Web page as a hub ordirectory. The quality of the links can be scored based on a variety ofparameters, including the number of different domains the links link to,how many links are alive or broken (a broken link is a hyperlink thatdoes not work), and/or other parameters. Thus, for example, in oneembodiment, the higher the number of different domains the links link toand/or the greater the number or percentage of alive links, the higherthe quality score. If the score exceeds a predetermined threshold, thesearch system will categorize the page as a hub page.

With respect to the “personal home page” category, by way of example,the search system can look for certain key words or phrases, such a“family,” “children,” and/or “personal home page.” In addition, thesearch system can determine how many pictures or JPEG files are on thepage, and the level of sophistication of the HTML or other formattingcode used to create the page. Thus, for example, based on the number ofkey words, the number of pictures, and/or the sophistication of the HTMLcode, a determination is made as to whether a page is likely to be apersonal home page. In one embodiment, the greater the quantity ofpersonal-oriented key words and pictures, and the lower the codesophistication (for example, the absence or low level of script files,in line IFRAMES, custom tags), the greater the likelihood that the pageis a personal web page. A score can be assigned by the search systembased on such properties, and if the score exceeds a predeterminedthreshold, the search system will categorize the page as a personal homepage.

With respect to the “commercial” or business category, by way ofexample, the search system can look for certain key words or phrases,such a “price,” “quantity,” “stock,” “ship,” “corporation,” and/or“shopping cart.” In addition, the search system can determine the levelof sophistication of the HTML or other formatting code used to createthe page. For example, the greater the use of script files, in lineIFRAMES, custom tags, and the like, the greater the probablesophistication of the code. By way of example, based on the number ofkey words and/or the sophistication of the HTML code, a determination ismade as to whether a page is likely to be a commercial or business page.In one embodiment, the greater the quantity of business-oriented keywords, and the greater the code sophistication, the greater thelikelihood that the page is a commercial or business web page. A scorecan be assigned by the search system based on such properties, and ifthe score exceeds a predetermined threshold, the search system willcategorize the page as a commercial or business web page.

With respect to the links category, by way of example, the search systemcan rank pages based on the number of links and/or the quality of linksthat contain the search characters or strings. For example, if the userwas interested in searching for information on Idealab, the user canenter the term Idealab in the links field, the greater the number oflinks a page has to the Idealab.com domain, the higher the searchranking. By way of example, the quality of the link can be based on howmany links are alive or not broken and/or the quality of links on thelinked pages.

At state 418C, as the user enters characters or strings into one or morecolumn search fields, the search results displayed to the user areincrementally limited to those that contain the characters or stringsentered by the user. In one embodiment, this narrowing process does notneed to access remote search engines, as the search result informationand Web pages are already cached locally. At state 420C, by clicking onan appropriate column header, the user can command the system to sortthe search results at state 422C based on a selected column. At state424C, clicking on the column header a second time reverses the sortorder displayed to the user at state 426C.

The user can optionally print the information contained in the list paneor area 306C, as well as the Web pages displayed in the view pane orfield 308C. In addition, the user can enter a Web address directly inthe Address Field in the address field 312C, and then press the keyboardEnter button or the displayed Go button 312C to go to that page. Asdescribed below with respect to the favorites function, the user cancause the search application to add the address or URL to the user'sfavorites list by activating the plus (+) button 320C. Advantageously,the user can navigate through the found Web pages using the Forward andBackward control buttons 314C, 316C, and can refresh a Web page using aRefresh button 318C. The Backward control is configured to navigate to apreviously viewed favorites Web page and the Forward control isconfigured to navigate a next favorites Web page.

To search the results or history of prior Internet and/or Web searches,a user can click on the History tab 312A and bring up the history searchinterface page, illustrated in FIG. 3D. The history process copies Websites visited by the user, and indexes content and attributes forsubstantially instant searching of those historical Web pages. When theHistory interface is presented to the user, stored Web pages previouslyviewed in the search application's view pane or area 308D, andoptionally, stored Web pages previously viewed using other userbrowsers, are listed in the history list pane or area 306D. Optionally,based on a user-selected option, the view pane or area 308D either showspage corresponding to a select item as the user saw it when on thedate/time it was added to the history or the most recent version of thepage in Internet Explorer's cache. These pages can be displayed in theview pane or area 308D even when the user is offline.

As illustrated in FIG. 4D, at state 401D the search system waits for auser input. The user input can be, for example, a search characterstring entered into the main search field, a search character stringentered into an attribute search field, a sort command, or an inversesort command.

At state 402D, alpha characters, numeric characters, words and/orphrases, such as those that may appear in the Web page contents, the Website name or title, the Web page URL, the Web page size, the pagecategory, the page domain, the page language, the links on the page, thedata/time the Web page was last viewed, the search terms entered whenthe Web page was originally located by the search application, the nameof the browser or other tool used to originally locate the Web page,and/or the media type used on the page that are entered by the user intothe main search field 304D are received.

At state 404D, as the user enters characters into the main search field304D the search of the listed items is incrementally performed by thesearch application's search engine, and the list pane or area 306Ddisplays incrementally narrowed down search results. The view pane orarea 308D displays the selected page. The user can optionally specifywhether the view pane or area 308D will display the stored Web page, asthe user saw it when the Web page was added to the history, or the mostrecent version of the page stored in a local cache, such as Microsoft'sInternet Explorer's cache. Because, in one embodiment, the Web pages arecached, the previously viewed Web pages can be displayed in the viewpane or area 308 even when the user's terminal is offline.

The user can also provide a relevant string or strings of characters ina corresponding column field to specify certain search resultattributes, such as a content field, a date/time (last viewed) field310D, a URL field, 312D, a title field 314D, a size field 316D, a mediatype field (not shown), a search term field 318D, and/or the browseroriginally used field (not shown), a Web page size field, a pagecategory field, a page domain field, a page language field, and a linksfield, which are received at state 406D. At state 408D, as the userenters characters or strings into a column search field, the search isfurther incrementally performed by the search application's searchengine, and the list pane or area 306D displays incrementally narroweddown search results. At states 410D, 412D, by clicking on an appropriatecolumn header, the user can command the system to sort the searchresults based on a selected column. At states 414D, 416D clicking on thecolumn header a second time reverses the sort order displayed to theuser.

Optionally, the size of the history index can be limited based on sizeor date, which can be specified by the user. In one embodiment, thehistory functionality can integrated with the Web search tab, so that auser can search his/her history directly from the Web tab in addition tolive Web searches. Optionally, the user can also enter a URL or addressinto the address field 320D to browse to a new URL and have that pageadded to the history. As described below with respect to the favoritesfunction, the user can cause the search application to add the addressor URL to the user's favorites list by activating the plus (+) button322D.

With reference to FIG. 3E, the favorites function is used to select anddisplay Web pages that the user often refers to and so wants to accessquickly. The user can select an add favorites button, such as the plus(+) symbol in the history or Web interfaces, as illustrated in FIGS. 3Cand 3D, above a Web page to cause the search application to add it tothe user's favorites. In addition, the user can enter the desiredaddress or URL of a Web page into address field 316E, and click the plus(+) button 318E to add the address to the user's favorites list, orclick the minus (−) button 320E to remove the address displayed in theaddress field 316E from the user's favorites list. The Web pagesincluded in the user's favorites are cached, attribute and contentindexed, and once indexed, instantly searchable.

To search the cached favorites Web pages, a user can click on thefavorites tab 314A and bring up the favorites search interface page,illustrated in FIG. 3E. In response to the user selecting the favoritessearch mode, user favorites are listed in the favorites list pane orarea 306E. The view pane or area 308E either shows that page as the usersaw it when on the date/time it was added to the favorites or the mostrecent version of the page in Internet Explorer's cache. These pages canbe displayed in the view pane or area 308D even when the user isoffline.

As illustrated in FIG. 4E, at state 401E the search system waits for auser input. If the user provides an entry into a main search field 304E,such as alpha characters, numeric characters, words and/or phrases thatthe user thinks may appear in the favorites Web page contents, the Website name or title, and/or the Web page URL, the entries are received atstate 402E. At state 404E, as the user enters characters into the mainsearch field 304E the search is incrementally performed by the searchapplication's search engine, and the list pane or area 306E displaysincrementally narrowed down search results. The view pane or area 308Edisplays the selected page. The user can optionally specify whether theview pane or area 308E will display the stored Web page, as the user sawit when the Web page was added to the favorites, or the most recentversion of the page stored in a local cache, such as Microsoft'sInternet Explorer's cache. Because, in one embodiment, the Web pages arecached, the previously viewed Web pages can be displayed in the viewpane or area 308 even when the user's terminal is offline.

The user can also provide a relevant string of characters in acorresponding column field, to specify certain search result attributes,such as a content field, a date/time (last viewed) field 310E, a URLfield, 312E, a title field 314E, a size field 316E, a media type field(not shown), a search term field 318E, and/or the browser originallyused field (not shown), a page category field, a page domain field, apage language field, and a links field, which are received at state406E. At state 408E, as the user enters characters into a column searchfield, the search is further incrementally performed by the searchapplication's search engine, and the list pane or area 306E displaysincrementally narrowed down search results. At states 410E, 412E, byclicking on an appropriate column header, the user can command thesystem to sort the search results based on a selected column. At states414E, 416E clicking on the column header a second time reverses the sortorder displayed to the user.

With reference to FIG. 3F, the attachment function is used to locateemail attachments. To search for email attachments a user can click onthe attachments tab 308A and bring up the attachments search interfacepage which lists attachments in the attachments list pane or area 306F.As illustrated in FIG. 4F, at state 401F the search system waits for auser input. The user can enter into a main search field 304F, alphacharacters, numeric characters, words and/or phrases that the userthinks may appear in the desired attachment contents, name, date/timecreated, date/time received, from (sender) field, size, MIME tag,extension, and/or path, which are received at state 402F.

At state 404F, as the user enters characters into the main search field304F the search is incrementally performed by the search application'ssearch engine, and the list pane or area 306F displays incrementallynarrowed down search results. The view pane or area 308F displays theselected page.

The user can also provide a relevant string of characters in acorresponding column field to specify certain search result attributes,such as a content field (now shown), a name field 310F, a date/timefield 312F, a from (sender) field 314F, a size field 316F, a MIME tagfield 318F, an extension field 320F, and/or a path field 322F which arereceived at state 406F. At state 412F, as the user enters charactersinto a column search field, the search is further incrementallyperformed by the search application's local search engine, and the listpane or area 306F displays incrementally narrowed down search results.

At states 410F, 412F, by clicking on an appropriate column header, theuser can command the system to sort the search results based on aselected column. At states 414F, 416F clicking on the column header asecond time reverses the sort order displayed to the user.

In one embodiment, a search interface page includes a selection menuthat specifies which of email, attachments, files, the Web, favorites,and/or history (previously accessed and stored Web pages) targets, areto be searched. In accordance with the user selection one or more of theemail, attachments, files, the Web, favorites, and/or history, orcorresponding indexes will be searched. In this embodiment, the columnattribute search fields optionally will correspond to the selectedsearch targets. Thus, for example, if the user selects both email andfiles as search targets, the column attribute fields can include one ormore of a from (sender) field, a to (addressee) field, date/time sentand/or received fields, a email folder path field, a client field, a ccfield, a size field, and a bcc field, wherein the foregoing fields areappropriate for email searches, and one or more of a file name field, afile type field, a date last modified or created field, a file sizefield, and a file path field.

The search application 102 optionally ignores punctuation of a first setof punctuation when indexing and performing a search, such as an emailor files search, and includes punctuation of a second set of punctuationin indexing and search operations. For example, the first set ofpunctuation can include hyphens, commas and/or semicolons. The secondset of punctuation includes punctuation marks that are more likely to beuseful when performing a search, such as the period or dot (.), commonlyused in file extensions, and the “at” sign (@), which is commonly usedin email addresses. Optionally, the user can specify which punctuationmarks are to be included when indexing and performing searches, andwhich punctuation marks are to be ignored.

In one embodiment, when using a string of one or more characters as asearch string on which to perform a search, such as an email or filecontent search, the search string is treated as a prefix. That is, thesearch string should match the initial characters and/or words in apotential matching string for the file or email to be displayed in thelist pane or area. For example, if the search string is “act”, thesearch application 102 will list a file containing the word “actual”,but will not list a file if the only appearance of “act” is “fact.”

The user can enter multiple separate search words or strings and eachone will optionally be treated as a prefix. For example, searching “JohDo” can be used to locate the strings “John Doe”. Thus, one embodimentof the present invention provides multiword separate prefix searching.The searching can be performed incrementally. For example, if the userenter “Jo D” documents with the strings (“Jon” and “David”), (“Joe” and“Daniel”), (“John” and “Doe”) will be located. If user then adds an “h”to “Jo” so that the search strings are now “Joh D” then the list ofsearch results will be substantially immediately narrowed to presentonly documents having at least one string beginning with “Joh” and onestring beginning with “D”, such as a document containing (“John” and“Doe”). Further, the user can enter one or more additional searchstrings into one or more other search fields, such as one or more columnsearch fields and the search system will incrementally search fordocuments that have strings that correspondingly begin with the searchstrings or that are exact matches for the search strings.

Optionally, when certain standard characters are present in a givenattribute, such as the standard “www.” in a URL, the search applicationtreats the character following the standard characters as the firstcharacter. Thus, for example, when searching for the string “bloc” byentering it in the URL field 310E in FIG. 3D, the URL “www.loc.gov.com”would be identified as a match. Optionally, the user can specify thatthe search application 102 should look for the search string no matterwhere it appears in a file or in email content.

In one embodiment, the main search field and/or attribute fieldsdiscussed above can act as a command line interface, in addition toacting as search fields. Thus, before executing or applying a searchoperation on one or more search strings, a determination is made as towhether the first search string corresponds to a command or is includedin a list of command words. If the first word or string does correspondto a command word then a corresponding command file is read, and thecorresponding command is executed. If there is more than one word orstring entered into a search field, if the first word or stringcorresponds to a command word, then the subsequent words or strings areoptionally treated as command arguments or terms, if appropriate. Inaddition, command files can call other command files by including thecalled command file name and relevant arguments. The list of commandword can grow or shrink as the user adds or deletes commands. Thepresent invention is not limited to receiving commands into a searchfiled. In other embodiments, other data entry fields can be used toenter commands as well as data.

By way of example, the command words can include email (corresponding toa command to create an email using the following term to determine whothe addressee is), shop (corresponding to a command for finding thelowest online price for the product named in the following term), book(corresponding to a command for finding a book at a named Web site),movies (corresponding to a command for finding a listing of show timesat a named Web site), web (corresponding to a command for launching abrowser, and if there is a URL following the “web” then the browser islaunched to that URL), convert (corresponding to a command used toconvert from one set of units or currency to another), msword(corresponding to a command to launch msword, and if there is afollowing term, open or create a document with a name corresponding tothe term), programname (corresponding to a command to launch the programhaving the name programname), semail (corresponding to a command tosearch the email index using the following string(s)), sfiles(corresponding to a command to search the files index using thefollowing string(s)), and so on with respect to the other searchtargets.

The command names can correspond to command files, which for example,can have the extension “.do,” though other extensions can be used aswell. The command files can optionally be programmed, created and/oredited by an end user, thereby providing for significant customizing byusers and without requiring sophisticated programming skills. Further,the user can add additional commands as desired. In addition, thecommand names can be names selected by the user that are easy for theuser to remember, as opposed to arbitrary names, or difficult toremember names. The instructions included in the command files can beinstructions unique to the search application, instructionscorresponding to other applications, and/or instructions that correspondto operating system instructions. Thus, a search field can also be usedas a DOS, UNIX, LINUX, or other operating system command line. Thecommand file can also include a URL.

For example, if the user types “president bush” in the files search modemain search field, then the search application searches for a file thatincludes the terms “president” and “bush.” But if the user types “emailsamantha jones” the search application compares the first string,“email”, with the command list, recognizes that email is on the commandlist and that email is a command, retrieves an “email.do” command filethat specifies how the command is to be processed, accordingly processesthe term “samantha jones”, and generates a blank email to samantha jonesby looking up the email address for samantha jones in the user's emailapplication.

By way of another example, if the user types in “semail samantha jones”the search application compares the first string, “semail”, with thecommand list, recognizes that semail is on the command list and thatsemail is a command, processes the rest of the line as an “semail”command, in this case a search within the user's indexed email for thestrings “samantha” and “jones”.

Optionally, some of the commands can be built into the searchapplication, and do not utilize a separate command file. For example,the semail, sfiles, and/or web commands can be built in and not use aseparate command file.

Optionally, the results of the command are displayed using the list areaand/or the view area. For example, with respect to the “semail” command,the list and view areas are employed as similarly described above withrespect to FIG. 3A.

At times, the arguments following a user-entered command may result inambiguity in one of the arguments that are to be processed. For example,with respect to the convert command, where there is ambiguity, the usercan be presented with several options from which the user can select. Byway of example, if the user types “convert 5 miles” then the list areacan be used to display user selectable choices, such as “meters”,“feet”, and “kilometers”. If the user selects one of the choices, thesearch application will display the results in the view area. Thus, ifthe user selects “kilometers” the convert command with convert 5 milesinto corresponding kilometers, and displays the results in the viewarea.

In another example, a command can be used to quickly retrieveinformation from a specific URL via the search application. For example,the book command file can include the following:

-   -   web        http://search.barnesandnoble.com/booksearch/results.asp?WRD=$1

By way of further example, the movies command file can include thefollowing:

-   -   web http://movies.yahoo.com/showtimes/showtimes.html?z=$1

The parameter “$1” in the above examples is filled in by the followingstring entered by the user. For example if the user typed in the entry“movies lord” the movies command would launch a browser, access theyahoo movie show time site, submit the string “lord”, and the show timesfor movies that include the string “lord”, such as “Lord of the Rings”would be displayed to the user. If the string(s) following the commandword are not HTML compliant, such as containing a space, HTML-escapingis performed, and the string(s) that are in violation will optionally beconverted into an HTML compliant string, such a by replacing the spacewith a +sign. Thus, “lord rings” is optionally converted into“lord+rings.”

Advantageously, in certain embodiments, indexing of multiple forms of aword is performed, and where punctuation marks are in a word orcharacter string, the word and character string can be selectivelybroken into multiple words or strings which can reduce index size andfacilitate searching. For example, when indexing, a set of punctuationis optionally declared non-word symbols and are treated as spaces toseparate words. For example, this set can include: <( [ { “. Theforegoing example set members are selected because the included symbolsare often used as open-parentheses and thus, a following characterstring can often be considered to be a new word. In addition, thedouble-quote is rarely used as a word joiner. Some characters, such asthe <characters, are treated as a non-word symbol because they appearvery frequently in HTML code used to used to generate Web pages. Byexcluding such characters, strings such as word<ib></font> areadvantageously not treated as a word.

Optionally, punctuation not included in the set of excluded punctuationdiscussed above will be treated as the same as punctuation in the setonly if the punctuation is within N characters of the start of the word.For example, N can be set at 2 or at another value. Optionally,punctuation not included in the set of excluded punctuation will betreated as a space if the punctuation is after P characters of the startof the word. For example, P can be set at 3, or 5, or at another value.By way of further example, if N=2 and M=3, xy.com would be indexed asone word because the period is not within 2 characters of the start ofthe string and is not more than 3 characters after the string. Inanother example, if N=2 and M=3, nbc.org is treated as two words, nbcand org because the period is more than 3 positions away from the startof the string.

The values of M and P are selected so that the portion of a word orcharacter string before the punctuation is long enough, such that evenif the string is treated as two separate words or strings each stringwill be sufficiently unique or identifiable for searching purposes.However, if the punctuation occurs too near the beginning of a word orstring, treating the prefix as a separate word or string may not provideenough characters for a user to recognize the word or string. Forexample, if “a.m.” were broken into two one word strings, “a” and “m”,the search results would not be as useful or may not be useful at all.

Optionally, the index process indexes the individual words separated bypunctuation, so that for xy.com, xy and com are separately indexed.

By way of further examples, if N=2 and P=5, then for johndoe@zxcv.ca,all the punctuation marks are treated as spaces and the followingstrings are separately indexed: johndoe, zxvc, ca. Similarly, the stringme@mnbvcxza.net is treated as a word or a single string. For the string1,234,467, 1,234 is treated as a word or single character string, and467 is treated as a word or character string.

Further, a user can exclude files, emails, and the like that contain aspecified string from appearing in search results displayed to the user.For example, in one embodiment, the user can exclude files or emailcontaining a particular string by putting a minus sign at the beginningof the character string when entering it into the main search field orother relevant column field. If a user wants to search for a file thatdoes not contain a first string as a prefix or exact match, and doescontain a second string as a prefix or exact match, the user can put aminus sign in front of the string to be excluded, then follow thatstring by one or more spaces, and then the character string to beincluded. For example, if the user searches for an email that does notcontain, as a prefix or exact match, the string “Fred” (or “Frederick”,“Frederica”, or other string or word with “Fred”) and does include thestring “Sol” (or “Solly”, Sole, or Solomon”), the user would enter“-Fred Sol”.

Thus, as described above, certain embodiments of the present inventionprovide advantageously provide for incremental searching for a varietyof search targets. For example, the search targets can include one ormore of files, emails, email attachments, Web pages, specific databases,and/or the like. Because a search is performed incrementally, the searchresults are provided or narrowed substantially immediately after eachcharacter is a search string is entered by a user. Thus, the user isprovided with substantially immediate feedback as the search string isbeing entered, and so can quickly decide on the desirability of enteringadditional search characters, or of entering a new search string.

In addition, the user can specify that an exact word match is to beperformed. In one example embodiment, if a selected punctuation symbol,such as =, follows a word or character string directly, and is followedby a space, as in “mark=”, then the word is searched as an exact match.In addition, optionally, if a selected punctuation symbol, which can bea !, @, #, $, %, ^, &, *, ), ;, :, ', “, ?, >, and/or other punctuationsymbol, follows a word or character string directly, and is followed bya space, as in “mark$” or “mark.” or “mark-”, then optionally the wordis searched as an exact match including the punctuation. So “mark-” willnot match “market” or “mark. However, “mark-” will match “mark-down.”

In another embodiment, the search system described above isserver-based, and the user can access the search functionally using athin client. Thus, for example, the user does not need to download to orotherwise store the search application on the user's terminal. Instead,the user can access over a network a remote search system and engineusing a browser. The remote search system can be used to incrementallysearch content original to a Web site or database associated with theremote search system, and/or to search data that resides or originallyresided on another Web site or network site. Nonetheless, even thoughthe search engine is remotely located, once initial search results aretransmitted to the thin client, the used can quickly scroll down andview the content corresponding to the items in the search results as ifthe search application was hosted by the thin client.

FIG. 5 illustrates an example computer system that can be used toprovide a server based incremental search system. As depicted, oneembodiment includes a browser 504 stored on and executed by a userterminal 502. While in the illustrated embodiment the terminal 502 is athin client, it can optionally include a local index engine, localsearch engine, and email application similar or identical to thosediscussed above with reference to terminal 104 illustrated in FIG. 1. Inthe example embodiment, the browser 504 can interpret HTML tags, handleIFRAMES, and execute JavaScript. The terminal 502 is coupled to a remotesite 508 via a network 506, which can be, by way of example, theInternet. The terminal 502 can be coupled to the Internet using anetwork interface circuit and port.

The remote site 508 includes a computer server 515 that executes andhosts a search engine 510, an index engine 513, and a Web server 514.The computer server 515 includes one or more content and index databases512 and JavaScript, which will be discussed in greater detail below. Thesearch and index engines 510, 513 can be the same as or similar to thesearch engine 107, index engine 103, search engine 110, and index engine103 discussed above, and can provide the same or similar functionality.The Web site 108 and search engine 110 are accessible to the searchapplication 102 via the Internet 113. The remote site 508 can be coupledto the Internet using a network interface circuit and port. The Web site108 may optionally include content that spans multiple Internet domains,and/or may be implemented using physical servers that are geographicallyremote from one another. In other embodiments, the Web site 108 may bein the form of an intranet site, in which case the terminal 502 may becoupled to the site by a private network. For example, Web site 508 maybe in the form of an internal corporate store site for companyemployees.

By way of example, the search engine 510 may search local server-baseddatabases of the full text of Web pages selected from selected Webpages. The databases can optionally be generated using a spider or thelike that search for Web pages on other Web sites, such as Web sites518, 520, which are then stored in the databases, and that follow thelinks in those Web pages to other Web pages, which are then stored inthe databases. Because the Web page databases, or portions thereof, mayhave been generated before a user submits a search query, thesedatabases may not be fully current. When the search engine 510 performsa Web search in response to a user query, the search engine returnsexcerpts from relevant Web pages as stored in a corresponding database,and also returns links to the current version of the Web pages, if suchexists.

By way of further example, as similarly discussed above with referenceto FIG. 1, the server system can create indexes such as thoseillustrated in FIG. 2A using the process illustrated in FIG. 2C, cancreate prefix indexes, and can pre-compute sort lists.

FIG. 6 illustrates an example search process using the server basedsearch system. At state 602, the user accesses via the browser 504 theserver based system 508 over the Internet 506. At state 504, the serversystem 508 transmits HTML, and/or other formatting code, text, andJavaScript to the browser 504. JavaScript is a client-side scriptinglanguage that runs inside an Internet browser or Web client. JavaScriptcan be embedded in Web pages or in other HTML documents. JavaScript canbe used to enhance Web page dynamics and interactivity, and can be usedto perform calculations, check forms, generate new HTML pages usingvariable information, open and close windows and frames, and the like.When the browser loads a page that includes JavaScript code, thebrowser's interpreter reads and executes the JavaScript code.

At state 606 the HTML code is rendered by the browser 504, theJavaScript is executed, and the browser 504 displays the searchinterface as similarly discussed above with reference to FIGS. 3A-I.Thus, for example, the search interface can include a list pane or areaand a view pane or area, and well as the search area, including a mainsearch field and column attribute fields. As similarly discussed above,the user can select a particular target search interface by clicking onan appropriate tab or link. For example, a given target search interfacecan include a plurality of search fields, including a main search fieldand a plurality of search fields. Thus, as similarly discussed above, anInternet search interface can include one or more of a content field, aURL field, a title or Web site name field, a size field, a date field, aformat field, a category field, a site field, a links field, and alanguage field, and a media type field.

At state 608, the browser, via the executed JavaScript, monitors userinputs or deletions from the interface fields. For example, the user canenter into the main search field and/or into one or more attributesearch fields one or more alpha characters, numeric characters, wordsand/or phrases that the user thinks may appear in the target content orattributes. In addition, the browser monitors and detects whether theuser is paging up or down, or is scrolling off the page. Optionally, thebrowser continuously monitors and detects user inputs, deletions,scrolling, and/or paging up or down (collectively and individually alsoreferred to as “user changes”) during states 610-622, and if such isdetected, the process proceeds or returns to state 610. At state 610 thebrowser transmits the user changes to the server system 508.

At state 612 the server system 508 incrementally searches for Web pages,documents, or other files in accordance with the search character,characters, string, or strings received by the server system 508. Thus,with each additional character entered by the user and detected at state608, the search engine 510 will further refine the search. Similarly,with each deletion of a search string character, the search engine 510will reduce the amount of search filtering to broaden the search. Forexample, when the user types “fast”, four separate searches areoptionally sent to the server, one for “f”, one for “fa”, one for “fas”,and one for “fast”. Each time, the server finds all documents containingwords starting with that sequence of letters, and returns a list of thetop matches. At state 614 the server system 508 transmits a list ofsearch results to the browser 504.

In one embodiment, the JavaScript optionally instructs the server system508 how many results to return, so that only as many results as areneeded to fill the list pane are transmitted. Advantageously, thisreduces the load on the server system 508, increases the speed ofpresentation, and yet provides to the user the illusion that the listgoes on further. In such an embodiment, when the user presses the PgDnbutton, clicks a More button, or scrolls down the list, an additionalrequest is sent to the server for the next pagefull of results.

Optionally, to determine how long the transmitted search result listshould be, the server system 508 autodetects the number of lines thebrowser 504 can display at a time. Optionally, the user can instruct theserver system 508 on how long the list should be.

At state 616 the browser 504 displays the list in the list pane or area.At state 618, the browser 504 automatically selects the first item inthe search result list to be displayed in the content pane or area. Atstate 620 to accelerate the presentation of content in the view pane fora selected item in the list pane, the pages or documents correspondingto the search result entries in the list pane are preloaded as soon asthe list is received from the server system 508. The JavaScript loadsIFRAMEs for each of the Web pages corresponding to the list entries. Aninline floating frame or IFRAME is a construct which embeds a documentinto an HTML document so that embedded data can be displayed inside asubwindow of the browser's window. The IFRAME tag creates a floatingframe at its location in the HTML file. The SRC attribute, which sets orretrieves a URL to be loaded, specifies the content to be displayedwithin the frame.

Thus, the Web page content loaded onto user's computer 502, whilekeeping hidden the display of the Web page content. Then, to present thecontent or Web page corresponding to a selected search list entry, theJavaScript renders the corresponding IFRAME visible in the view pane.

FIG. 7 illustrates an example process of scrolling through a search listloaded via a browser as described above with reference to FIGS. 5 and 6.At state 702, the JavaScript detects when the user is scrolling up anddown the list displayed in the browser 504. At state 704, the browserselects the corresponding search list item for display in the view pane.At state 706 the browser 504 displays the selected search result contentthat had already been stored in an IFRAME in the view pane.Advantageously, because the content for each item in the list hasalready been preloaded, the scrolling and content display takes placesubstantially instantaneously and the Internet does not have to beaccessed.

Thus, as described above, certain embodiments of the present inventionprovide an efficient user interface, and advantageously provides forincremental searching for a variety of search targets using client-basedor server-based search applications. Because search can be performedincrementally, the user is provided with substantially immediatefeedback as the search string is being entered, and so can quicklydecide on the desirability of entering additional search characters.Advantageously, the user interface provides target specific attributesearch fields, thereby enabling a user to efficiently locate the desiredemail, files, attachments, Web pages, and the like. In addition, incertain embodiments, incremental indexing is performed, therebyproviding consistently current search indexes. Advantageously, incertain embodiments indexing of multiple forms of a word is performed,and where punctuation marks are in a word or character string, the wordand character string can be selectively broken into multiple words orstrings which can reduce index size and facilitate searching.

It should be understood that certain variations and modifications ofthis invention would suggest themselves to one of ordinary skill in theart. The scope of the present invention is not to be limited by theillustrations or the foregoing descriptions thereof.

1. A computer readable storage medium having instructions stored thereinthat are configured for execution by a computing device, theinstructions comprising: instructions configured to generate a pluralityof user selectable search mode interfaces, including at least: an emailsearch mode interface having email-specific attribute search fieldsincluding at least a date field, a from field, and a sender field; afile search mode interface having file-specific attribute search fieldsincluding one or more of a file name field, a file type field, a datefield, a file size field, and a path field; an email attachment searchmode interface including one or more of a name field, a date field, asize field, and an extension field; wherein each of the email searchmode interface, the file search mode interface, and the email attachmentsearch mode interface further comprises a list pane, used to display alist of search result items, and a view pane, used to display thecontents of a selected search result item; an index module that, whenexecuted by the computing device, generates at least an email index, afile index, and an email attachment index; and a search module that,when executed by the computing device, performs incremental searching ofat least one of the email index, the file index, and the emailattachment index in response to the user entering characters into atleast one search mode interface field.
 2. The computer readable mediumof claim 1, wherein the search module is configured to update the listof search result items after each character is received in one of thesearch mode interface fields without further input from the user.
 3. Thecomputer readable medium of claim 1, wherein the email search modeinterface, the file search mode interface, and the email attachmentsearch mode interface are alternatively viewable within a single searchinterface configured for display on a display device.
 4. The computerreadable medium of claim 3, wherein the single search interfacecomprises tabs that are selectable by the user in order to initiatedisplay of a respectively associated search mode interfaces, the tabscomprising at least an email search mode interface tab, a file searchmode interface tab, and an email attachment search mode interface tab,wherein each of the tabs is displayed in the single search interfaceregardless of which of the email search mode interfaces is currentlyselected for display.
 5. The computer readable medium of claim 1,wherein the search module is configured to automatically display in theview pane at least a portion of the contents of a first search resultitem in the list of search result items.
 6. The computer readable mediumof claim 5, wherein in response to the first of the list of searchresult items changing in response to updating of the list of searchresult items after a character is received in one of the search modeinterface fields, the search module is configured to automaticallyupdate the view pane to display at least a portion of the contents of anew first search result item in the changed list of search result items.7. The search apparatus as defined in claim 1, wherein the email indexcomprises email attributes including one or more of sender information,addressee information, date information, folder path information, clientinformation, cc information, and bcc information; the file indexincludes file attributes including one or more of file name information,file type information, date information, file size information, and pathinformation.
 8. A search apparatus, comprising: a computer readablestorage medium storing software modules for execution by the searchapparatus, the software modules comprising: an interface moduleconfigured to selectively generate a plurality of user selectable searchmode interfaces having attribute fields, including at least: an emailsearch mode interface having email-specific attribute search fieldsincluding one or more of a date field, a from field, and a sender field;a file search mode interface having file-specific attribute searchfields including one or more of a file name field, a file type field, adate field, a file size field, and a path field; an email attachmentsearch mode interface including one or more of a name field, a datefield, a size field, and an extension field; wherein each of the emailsearch mode interface, the file search mode interface, and the emailattachment search mode interface further comprises a list pane, used todisplay a list of search result items, and a view pane, used to displaythe contents of a selected search result item; an index moduleconfigured to generate at least a first index, including one or moreindexes of emails and files; one or more memories configured to storethe first index, including one or more indexes of emails and files; anda search module configured to perform incremental searching of the firstindex as the user enters characters into at least a first attributesearch field such that a list of search results is updated with one ormore of matching emails and matching files after each character of oneor more search strings is received from a user in one of the attributesearch fields.
 9. The search apparatus of claim 8, wherein the emailsearch mode interface and the file search mode interface each furthercomprise a list area used to display the list of search result itemsthat is updated as each character of the one or more search strings isreceived from the user.
 10. The search apparatus of claim 8, wherein theemail search mode interface further comprises a search field used tosearch across a plurality of the attribute fields.
 11. The searchapparatus of claim 8, wherein the email search mode interface and thefile search mode interface are alternatively viewable within a singlesearch interface configured for display on a display device.
 12. Thesearch apparatus of claim 11, wherein the single search interfacecomprises areas that are selectable by the user in order to initiatedisplay of a search mode interface corresponding to the respectiveselected area, the user-selectable areas comprising at least an emailsearch mode interface area and a file search mode interface area,wherein each of the user-selectable areas are displayed in the singlesearch interface regardless of which of the search mode interfaces iscurrently selected for display.